BTCC / BTCC Square / Global Cryptocurrency /
Large Reasoning Models Struggle with Instruction Adherence, Study Reveals

Large Reasoning Models Struggle with Instruction Adherence, Study Reveals

Published:
2025-10-23 01:45:02
17
2
BTCCSquare news:

Together AI's latest research exposes a critical flaw in large reasoning models (LRMs), revealing their inconsistent ability to follow instructions during complex reasoning tasks. The study introduces ReasonIF, a 300-problem benchmark that evaluates multilingual reasoning, formatting constraints, and word limits.

While LRMs demonstrate competence in final outputs, their reasoning processes frequently deviate from specified instructions. This adherence gap widens with task complexity, raising fundamental questions about AI controllability in high-stakes applications.

The findings arrive as AI systems increasingly handle sensitive financial operations, from algorithmic trading to risk assessment. Market participants should note these reliability concerns when implementing LRM-powered solutions in cryptocurrency analytics or automated trading systems.

|Square

Get the BTCC app to start your crypto journey

Get started today Scan to join our 100M+ users

All articles reposted on this platform are sourced from public networks and are intended solely for the purpose of disseminating industry information. They do not represent any official stance of BTCC. All intellectual property rights belong to their original authors. If you believe any content infringes upon your rights or is suspected of copyright violation, please contact us at [email protected]. We will address the matter promptly and in accordance with applicable laws.BTCC makes no explicit or implied warranties regarding the accuracy, timeliness, or completeness of the republished information and assumes no direct or indirect liability for any consequences arising from reliance on such content. All materials are provided for industry research reference only and shall not be construed as investment, legal, or business advice. BTCC bears no legal responsibility for any actions taken based on the content provided herein.